Search for: All records

Creators/Authors contains: "Apley, Daniel W."

« Prev Next »

Total Resources

8

Resource Type
Conference Paper

0

Conference Proceeding

0

Dataset

0

Journal Article

8

Workshop Report

0

Availability
Full Text / Resource Available

8

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Uncertainty-aware mixed-variable machine learning for materials design

https://doi.org/10.1038/s41598-022-23431-2

Zhang, Hengrui ; Chen, Wei ; Iyer, Akshay ; Apley, Daniel W. ; Chen, Wei ( December 2022 , Scientific Reports)

Abstract Data-driven design shows the promise of accelerating materials discovery but is challenging due to the prohibitive cost of searching the vast design space of chemistry, structure, and synthesis methods. Bayesian optimization (BO) employs uncertainty-aware machine learning models to select promising designs to evaluate, hence reducing the cost. However, BO with mixed numerical and categorical variables, which is of particular interest in materials design, has not been well studied. In this work, we survey frequentist and Bayesian approaches to uncertainty quantification of machine learning with mixed variables. We then conduct a systematic comparative study of their performances in BO using a popular representative model from each group, the random forest-based Lolo model (frequentist) and the latent variable Gaussian process model (Bayesian). We examine the efficacy of the two models in the optimization of mathematical functions, as well as properties of structural and functional materials, where we observe performance differences as related to problem dimensionality and complexity. By investigating the machine learning models’ predictive and uncertainty estimation capabilities, we provide interpretations of the observed performance differences. Our results provide practical guidance on choosing between frequentist and Bayesian uncertainty-aware machine learning models for mixed-variable BO in materials design.
more » « less
Full Text Available
Bias-corrected Estimation of the Density of a Conditional Expectation in Nested Simulation Problems

https://doi.org/10.1145/3462201

Yang, Ran ; Kent, David ; Apley, Daniel W. ; Staum, Jeremy ; Ruppert, David ( October 2021 , ACM Transactions on Modeling and Computer Simulation)

Many two-level nested simulation applications involve the conditional expectation of some response variable, where the expected response is the quantity of interest, and the expectation is with respect to the inner-level random variables, conditioned on the outer-level random variables. The latter typically represent random risk factors, and risk can be quantified by estimating the probability density function (pdf) or cumulative distribution function (cdf) of the conditional expectation. Much prior work has considered a naïve estimator that uses the empirical distribution of the sample averages across the inner-level replicates. This results in a biased estimator, because the distribution of the sample averages is over-dispersed relative to the distribution of the conditional expectation when the number of inner-level replicates is finite. Whereas most prior work has focused on allocating the numbers of outer- and inner-level replicates to balance the bias/variance tradeoff, we develop a bias-corrected pdf estimator. Our approach is based on the concept of density deconvolution, which is widely used to estimate densities with noisy observations but has not previously been considered for nested simulation problems. For a fixed computational budget, the bias-corrected deconvolution estimator allows more outer-level and fewer inner-level replicates to be used, which substantially improves the efficiency of the nested simulation.
more » « less
Full Text Available
Bayesian Optimization for Materials Design with Mixed Quantitative and Qualitative Variables

https://doi.org/10.1038/s41598-020-60652-9

Zhang, Yichi ; Apley, Daniel W. ; Chen, Wei ( December 2020 , Scientific Reports)

Full Text Available
A Latent Variable Approach to Gaussian Process Modeling with Qualitative and Quantitative Factors

https://doi.org/10.1080/00401706.2019.1638834

Zhang, Yichi ; Tao, Siyu ; Chen, Wei ; Apley, Daniel W. ( July 2019 , Technometrics)

Full Text Available
Latent variable Gaussian process models: A rank‐based analysis and an alternative approach

https://doi.org/10.1002/nme.6690

Tao, Siyu ; Apley, Daniel W. ; Plumlee, Matthew ; Chen, Wei ( April 2021 , International Journal for Numerical Methods in Engineering)

Abstract
Gaussian process (GP) models have been extended to emulate expensive computer simulations with both qualitative/categorical and quantitative/continuous variables. Latent variable (LV) GP models, which have been recently developed to map each qualitative variable to some underlying numerical LVs, have strong physics‐based justification and have achieved promising performance. Two versions use LVs in Cartesian (LV‐Car) space and hyperspherical (LV‐sph) space, respectively. Despite their success, the effects of these different LV structures are still poorly understood. This article illuminates this issue with two contributions. First, we develop a theorem on the effect of the ranks of the qualitative factor correlation matrices of mixed‐variable GP models, from which we conclude that the LV‐sph model restricts the interactions between the input variables and thus restricts the types of response surface data with which the model can be consistent. Second, following a rank‐based perspective like in the theorem, we propose a new alternative model named LV‐mix that combines the LV‐based correlation structures from both LV‐Car and LV‐sph models to achieve better model flexibility than them. Through extensive case studies, we show that LV‐mix achieves higher average accuracy compared with the existing two.

more » « less
Database, Features, and Machine Learning Model to Identify Thermally Driven Metal–Insulator Transition Compounds

https://doi.org/10.1021/acs.chemmater.1c00905

Georgescu, Alexandru B. ; Ren, Peiwen ; Toland, Aubrey R. ; Zhang, Shengtong ; Miller, Kyle D. ; Apley, Daniel W. ; Olivetti, Elsa A. ; Wagner, Nicholas ; Rondinelli, James M. ( July 2021 , Chemistry of Materials)
Leveraging the nugget parameter for efficient Gaussian process modeling

https://doi.org/10.1002/nme.5751

Bostanabad, Ramin ; Kearney, Tucker ; Tao, Siyu ; Apley, Daniel W. ; Chen, Wei ( May 2018 , International Journal for Numerical Methods in Engineering)
Nonhierarchical multi-model fusion using spatial random processes: Nonhierarchical multi-model fusion using spatial random processes

https://doi.org/10.1002/nme.5123

Chen, Shishi ; Jiang, Zhen ; Yang, Shuxing ; Apley, Daniel W. ; Chen, Wei ( September 2015 , International Journal for Numerical Methods in Engineering)